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t-H , In this paper, we study the complete graph K n with n vertices, where we attach an 

(N ■ i.i.d. weight to each of the n(n — l)/2 edges. We focus on the weight W n and the number of 

I edges H n of the minimal weight path between vertex 1 and vertex n. 

p/| It is shown in [6] that when the weights on the edges are independent and identically 

p j distributed (i.i.d.) with distribution equal to E s , where s > is some parameter and E has 

an exponential distribution with mean 1, then H n is asymptotically normal with asymptotic 
^3 ' mean s log n and asymptotic variance s 2 log n. In this paper, we analyze the situation when 

the weights have distribution E s , s > 0, where the behavior of H n is markedly different as 
Q H n is a tight sequence of random variables. More precisely, we use Stein's method for Poisson 

approximation to show that, for almost all s > 0, the hopcount H n converges in probability to 
I \ the nearest integer of s + 1 greater than or equal to 2, and identify the limiting distribution of 

■ the recentered and rescaled minimal weight. For a countable set of special s values denoted by 

lO ' S = {sj}j>2, the hopcount H n takes on the values j and j + 1 each with positive probability. 

(N . 
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.9 s . , 1 Introduction 



One of the central themes of modern discrete probability has been the study of the effect of 
random edge disorder on various properties of the underlying network. The base network itself 
could be deterministic, e.g. a large finite box in the lattice or the complete graph on n vertices, 
or random, e.g. the giant component of the Erdos-Renyi random graph or the configuration 
model. Each edge is assigned a random edge weight, whose interpretation varies depending 
on the context. One can think of this weight as the cost in traversing the edge, yielding first 
passage percolation type models. Alternatively, one could think of the underlying graph as a set 
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of resistors connected together and the assigned weights as resistors, yielding a random resistor 
network, or as capacities on edges and the underlying graph as a flow carrying network, entrusted 
with carrying flow (commodities, information etc) between various parts of the network. 

One graph model that has resulted in many problems of fundamental interest is the complete 
graph K n on n vertices with random edge weights. In the various contexts mentioned above, this 
model both gives rise to very interesting conjectures as well as generates many new techniques 
and insights in probability theory that can then be applied in a number of other contexts. While 
providing a complete list of references of the various models that have been studied in this context 
would be impractical, we direct the interested reader to [14] for one of the first refined results in 
first passage percolation in this context, [10] for an analysis of the cost of the minimal spanning 
tree, [11] for an analysis of random electrical networks on the complete graph, [2] for a study of 
the random assignment problem, [4] for an analysis of the multicommodity flow problem, and 
the survey paper [1] where a number of other examples are analyzed via the powerful local weak 
convergence method. 

Let us now focus on the particular problem dealt with in this study and the motivations behind 
the problem. Suppose we start with a connected graph Q n (deterministic such as K n or random) 
on n vertices. Suppose each edge is assigned a random positive edge weight u e . We shall assume 
that the weights are independent and identically distributed over edges with some distribution 
F, with density /. Fix two vertices (say chosen uniformly at random from Q n ) and let us denote 
them by 1 and n say. For any path V between the two vertices, let the weight of the path w(V) 
be defined by 

i.e. the sum of weights of the edges in the path. The optimal or minimal weight path (which 
is unique since the edge weights have a density) is the path that minimizes the above weight 
function. In the study of random systems, this regime is often called the weak disorder regime, 
while probabilists know this problem as "first passage percolation" . The mental picture one can 
have is that the network is entrusted with carrying flow between various parts of the network, 
and the way it performs this duty is via routing flow through optimal paths. We shall defer a 
more extensive discussion of the relevant literature to Section 3. 

Another regime which is of tremendous interest is the strong disorder regime. Here the weight 
of a path which we shall denote by w st as the maximum or minimum weight of all edges in the 
path. In these two cases we shall denote the weight functions as 

«wCP) := maxit e (1.1) 

and 

w min (V) := minu e . (1.2) 

eeV 

In both situations, one is the interested in properties of the path which minimizes the above weight 
function. One is also interested in formulating a model, depending on a real valued parameter, 
the "inverse parameter" , (in this study denoted by s S R) which interpolates between these two 
models. One can then study questions such as phase transitions, where these is a change in the 
behavior of the system from the weak disorder regime to the strong disorder regime. Given the 
set of edge weights u e , one method of doing this is as follows: assign each edge a cost u% where 
s £ R is a real valued parameter. With these edge weights, suppose that, as before, we consider 
the weak disorder regime, so that the weight of a path V is 

eeV 
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Then 

(a) Original model: s = 1 is our original model. 

(b) Graph distance: s = gives us the graph distance between the chosen vertices in the graph 

Qn- 

(c) Strong disorder, max edge weight: As s — > +00 gives us the strong disorder regime where 
the weight of a path is given by (1.1). This is also called the minimal spanning tree regime as 
the optimal path between the two vertices is the same as the path in the minimal spanning tree 
on Q n with edge weights u e . 

(d) Strong disorder, min edge weight: s —> —00 gives us the strong disorder model where 
the weight of a path is given by (1.2). 

Thus, this model allows us to interpolate between various regimes of interest. We shall denote 
the optimal path by 7 3 pt( s )- Given a particular base network Q n and edge weight distribution 
u e , two statistics are of paramount interest: 

(i) Minimal weight: This is the actual weight of the optimal path, namely W n = Yleev t (s) u e- 

(ii) Hopcount: This is defined as the number of edges in the optimal path 'Popt('S). We shall denote 
this random variable by H n (s). 

Aim of this paper: In this paper, we shall specialize to the case where the graph Q n is the 
complete graph K n and each edge originally has edge weight u e ~ Exp(l). We shall study the 
case where s < 0. The case where s > has been solved in [6], where it was proved that, for 
s > 0, 

H n (s) - slog n d) z - ig . 
\J s 2 log n 

where Z ~ N(0, 1) denotes a standard normal distribution, while there exists a constant A = 
A(s) > and a non-degenerate real-valued random variable S(s) such that 

W n -jlogn A 3(a) (1.4) 

In this study, we shall derive asymptotics for the two random variables of interest namely W n (s) 
and H n (s) as n — > 00, and see that the behavior in the case when s < is markedly different. 
Without further ado let us now state our main results and defer a further discussion to Section 
3. 



2 Results 



Before stating the main result, we need some notation. We will study the complete graph K n with 
i.i.d. edge weights £7/.,, 1 < i < j < n, on the edges of K n . Thus, compared to the discussion in 
the previous section, we have replaced s there by — s here, and we shall study the s > regime. 
For fixed s > 0, define the function 

9s (x)= XS+ x>2. (2.1) 

(x — l) s 

Observe that, for < s < 1, the function g s (x), x > 2, is increasing, while for s > 1, the function 
is strictly convex with unique minimum at x = s + 1. We shall be interested in minimizing 
this function only on the space Z+, the set of positive integers. Then, there is a sequence of 
values s = Sj, j > 2, for which the minimum integer of g s is not unique. From the equation 
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9s(j) = 9s(j + 1) an d the bounds j — 1 < s < j, it is not hard to verify that 



Si 



io g (i+r 1 



log(l + (j 2 - l)- 1 ) 



jpn J = 2,3,.... (2.2) 



We will need to deal with these special points separately. When s ^ {s2, • • •} then there is a 
unique integer which minimizes the function g s (x) on Z+. Let S = {,§2, ■ ■ •}• 

Let us now state the main theorems. Below and the rest of the paper, for notational simplicity, 
we define p = 1/s, for s > 0, and, for x S R, we denote by \x\ the largest integer smaller than or 
equal to x and by \x] the smallest integer larger than or equal to x. 

Theorem 2.1 (Hopcount and weight asymptotics) For any fixed s > with s £ S, let 
k*(s) S {[s + lj, \s + 1]} denote the unique integer that minimizes the function defined in (2.1). 
Then, 

(a) the hopcount H n = H n (s) converges in probability to k*(s), i.e., for n — > oo 

F(H n = k*(s)) -> 1; 



(b) the optimal weight W n , properly normalized converges in distribution as n — > oo, 
k ~ l n \s+i frxr Ss(k) \ k-1 p(k - 1) log g s {k) 

-)■ exp(-ajfceV(fc - 1)!), 
where k = k*(s), and the sequence of constants (ttfc)fc>i satisfies the recursion a\ = I, 

a k = a,_ x ^ f fc -y- 2W2 ^-iV^ _ 1)1/2, fc > 2 . (2 .3) 



Theorem 2.1 states that the hopcount H n converges to the optimal value of the function x t— > g s (x) 
defined in (2.1), while the rescaled and recentered minimal weight W n converges is distribution to a 
Gumbel distribution. We can intuitively understand this as follows. For fixed k, the minimal path 
of length k is similar to an independent minimum of copies of sums of k random variables E~ s . 
The number of independent copies is equal to the number of disjoint paths between vertices 1 and 
2, which is close to n k ^ 1 . While on K n , the appearing paths do not have independent weights, the 
paths that are particularly short are almost independent. Now, the independent problem can be 
handled in two steps. First, we analyse the behavior of the random variable Z k = E^ s + • • - + E^ S . 
In this analysis, the function g s appears in the lower tails of the distribution. Secondly, we study 
the asymptotics of the minimum of n k ~ 1 of such random variables, which can be seen to be 
of order g s (k)/ (log n s ). This explains why the minimal integer value of g s is the crucial value 
for the hopcount, while the minimum of a large number of independent random variables with 
distribution Y, properly rescaled and recentered, converges to a Gumbel distribution by standard 
extreme value arguments. This intuitively explains Theorem 2.1. The main difficulty in the proof 
is to handle the fact that the weights of paths in the complete graph are actually not independent, 
and we use Stein's method for the Poisson approximation to deal with the available dependence. 

Let us now deal with the case where s € S. 
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Theorem 2.2 (The special set S) Suppose s € S, so that both \s + 1] and [s + lj minimize 
g s (-) over Z + . Define a sequence of independent random variables (Ek)k>2> where, for any k > 2, 
Sfc has the Gumbel distribution 



F(E k >t) = exp (- jj^y/j t e R. ( 2 - 4 ) 

with (ak)k>2 is defined in (2.3). Then, 

(a) the weight W n of the optimal path satisfies that, as n — > oo, 

(logn) s+1 / 5* \ 1 plogg* d . / S Ls+ ij H rs+1] 
W,, — -r — + - log log n > mm 1 



sg* V (logn)V 2 2 VLs + lJ -1' [s + 1] -1 

where g* = g s ([s + lj) = g s (\s + 1]). 

f&J t/ie hopcount H n (s) satisfies that, as n — >■ oo, 

ff n ( S ) A ff*, 
= argmin{H fc /(A: - 1) : A; G { [s + lj , [s + 1] }}. 



Another quantity of interest is the distribution of optimal paths between vertex 1 and a se£ of 
vertices. In telecom, this is called multicast, since one source sends to a multiple number of users. 
This also follows from the analysis in the paper. We shall give a brief idea of the proof in Section 
4.6. This result is stated for s ^ S, but one could state an equivalent result for s G S as well. 
Before we state the result we need some notation. Recall that we used k*(s) to denote the unique 
minimizer of g s (-) over Z+. For fixed m > 1, let {??i}i<i< m denote independent copies of the 
Gumbel random variable defined in (2.4) with k = k*(s). 



Corollary 2.3 (Multipoint weights) Fix m distinct vertices say 2, 3, . . . m+1 in K n . Suppose 
s ^ S and let {Wn}2<j<m+i denote the weight of the optimal path from 1 to these vertices. Write 

n = 0og >•)*+' (wi - J^L) + ^llo g logn- ^- 1 )'° M -W , 



where k = k*(s). Then, as n —> oo, 



KKm • 



Organization of the paper: The paper is organized as follows. We first discuss the relevance 
of our results and techniques in Section 3. We shall then continue to prove the main results in 
Section 4. 



3 Discussion 



We now provide a discussion of the various concepts used in this paper and the relevance of the 
results. 

(a) Stochastic mean-field model of distance: This notion refers to the complete graph with 
exponential mean 1 edge weights. The model gives a simpler but mathematically more tractable 
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model of distances between random points in high dimensions. While one can consider other edge 
distributions, the lack of memory property of the exponential random variable allows one to give 
clean proofs in a number of different contexts, including first passage percolation, see [14] where 
this property is used to great effect to derive refined asymptotics. We also refer to [1] where 
many other computations are derived in this context with the help of a powerful infinite random 
structure called the Poisson weighted infinite tree. 

(b) Weak and strong disorder: The last few years, with the availability of an enormous 
amount of data on real-world networks, has also witnessed an explosion in network models for 
these real-world networks as well as dynamics on them. Physicists have been highly interested 
in understanding the effect of random disorder on the various flow carrying properties of these 
network models. Via simulations, they have predicted a number of fascinating phenomena in 
these networks. Regarding the notions of weak and strong disorder mentioned in this paper, we 
refer the interested reader to [12], [8], [9] and [17] and the references therein. 

(c) First passage percolation: First passage percolation problems have been of great interest 
to probabilists for quite a while now, not just because of their origin from physical motivations 
of modeling disordered random flow systems, but also because this process and it's variants (e.g. 
oriented first passage percolation and last passage percolation) arise as basic constructing blocks 
for more complicated problems such as the contact process. There has been an intensive study 
of this model on the ci-dimensional lattice (see e.g. [15], [16] and [13]). The case of the complete 
graph with exponential edge weights was analyzed in [14], where in particular it was proved that 
the weight and hopcount of the optimal path satisfy: 

nW n — logn — > E, 

and 



as tl — y oo , where E G R is a non-degenerate random variable. 

In the last few years, due to the connections to real-world networks described above, these ques- 
tions have taken on an added significance and a number of studies both at the non-rigorous level 
([8]) and rigorous level (see e.g. [7]) have been undertaken to study such questions in many other 
random graph models. 

(d) Proof techniques: A number of different techniques have been used in the analysis of first 
passage percolation asymptotics in various contexts, ranging from subadditive methods in the 
context of the lattice, to continuous time branching process embeddings and renewal theory in 
the context of various random graph models. In particular, for s > 0, [6] used embeddings into 
a particular continuous-time branching process theory to derive the results in (1.3) and (1.4). 
As far as we know, this is the first paper that uses Stein's method for Poisson approximation 
to derive refined asymptotics in the first passage percolation context. In particular the results 
here complete the program started in [6] and show that for s < 0, H n = O e (l) while for s > 0, 
H n ~ slog?i. Clearly, this further shows that there are at least two universality classes for first- 
passage percolation on the complete graph in terms of the edge weight distribution. When s < 0, 
the weights E s are in the same universality class as the weight 1, in the sense that the hopcount 
remains bounded, while for s > 0, they are in the same universality class as the exponential 
distribution arising for s = 1 . As discussed in more detail in [6] , this raises the question what the 
universality classes for first passage percolation on K n are. In particular, does H n always satisfy 
a central limit theorem whenever H n — > oo? Or are there classes of edge weight distributions 
where the behavior is even different? For example, is there a class of random edge weights where 
the behavior is similar as for the minimal spanning tree, where H n is of the order ra 1 / 3 . 

(e) Multi-point distances and exchangeability: The classical probability theory of exchange- 
ability has in the last few years been used to analyze various complex random structures, see [3] 



6 



for a nice modern survey. In the context of Corollary 2.3, one can analyze such questions in a 
number of different contexts (such as the stochastic mean-field model of distance or your favorite 
random graph model with your favorite random edge weights). For the stochastic mean-field 
model, the multi-point optimal path weights converge (after proper rescaling and recentering) to 
an exchangeable sequence of random variables. In the present model, we can once again show 
convergence but to an independent sequence of random variables. 

4 Proofs 

This section contains the proofs of the main results. Throughout the paper, we make use of the 

following standard notation. We let denote convergence in distribution, and convergence 
in probability. For a sequence of random variables (X n ) n >i, we write X n = O p (1) when \X n \ is 
a tight sequence of random variables as n — > oo, and X n = o P (l) when \X n \ — > as n — > oo. 
For a non-negative function n \— > g(n), we write f(n) = 0(g(n)) when \f(n)\/g(n) is uniformly 
bounded, and f(n) = o{g(n)) when lim n _ i , 00 f(n)/g(n) = 0. We let Exp(A) denote an exponential 
random variable with rate A and Poi(A) a Poisson random variable with mean A. Finally, we 
write that a sequence of events (£ n )n>i occurs with high probability (whp) when P(£ n ) — > 1. 

Let us first give an outline of the proof. In Section 4.1, we shall show that the hopcount H n is 
a tight sequence of random variables as n — > oo. In Section 4.2, we shall derive the asymptotic 
behavior, for z ], 0, of the distribution function Fk(z), where 

F k (z) = P(E^ S + E^ s + . . . + E^ s < z), (4.1) 

and where E\,E2,... is an independent and identically distributed (i.i.d.) sequence of Exp(l) 
random variables. Denoting the weight of the minimal path with exactly k edges between vertex 
1 and vertex n by Wk(n), we then show that, uniformly in k > 2 and any < e < 1, the following 
statement holds whp, 

(logn) s W k (n) > (1 - e) 9s (k), (4.2) 

where g s (x) is defined in (2.1). 

The inequality (4.2) yields a first order lower bound for all values of k > 2. We will show in the 
paper that the function g s (x) determines the first order asymptotics of the weights Wk(n). The 
behavior of g s near its minimum value determines the asymptotic behavior of the hopcount H n . 
Roughly speaking, the hopcount H n will converge in probability to the integer k = k*(s) that 
minimizes the function g s (x), x > 2. The above statement about the convergence of H n is true 
for every s > for which the minimizing integer of g s (x) for x > 2 is unique, i.e., s S. In 
Section 4.3, we study the minimum of an independent number of sums of n k ~ 1 random variables 
with distribution E~ s , and use it to complete the proof of Theorem 2.1 when < s < 1. In 
Section 4.4, we extend the analysis to s > 1 and complete the proof of Theorem 2.1 by studying 
the second order asymptotics of the minimal weight of paths of length k in the complete graph 
K n . 

For s G S, to decide whether the hopcount H n converges in probability either to [sj + lj or to 
\sj + 1] we need the second second order asymptotics of Wfc(n), which is carried out in detail in 
Section 4.5. In Section 4.6, we sketch the proof of Corollary 2.3. 



7 



4.1 Tightness of the hopcount 



Note that the minimal weight W n satisfies the following inequality: 

W n >H n - min E7 S , (4.3) 
l<i<n(n-l)/2 

where E$ ~ Exp(l), are independent. Since the maximum of n independent exponentials scales 
like (1 + o P (l)) logn, we obtain from (4.3) that whp 

W n > H n \og- s {n{n-l)/2){l + o r (l)). (4.4) 

On the other hand W n is at most equal to the minimal weight of all two-edge paths between 1 
and n. Here, a two-edge path is a path of the form 1 — > i — > n, i = 2, 3, . . . n — 1, so that: 

W n < min mr s + (E'J)-% (4.5) 

2<j<n— 1 J J 

where EL 2 < j < n — 1, and E'- , 2 < j < n — 2, are independent Exp(l) random variables. It is 
not hard to verify, see Lemma 4.1 in Section 4.2, that (4.5) implies that whp, 

W n < — ^— . (4.6) 
(log n) b 

The inequalities (4.4) and (4.6) together imply that whp, 

H n = T {\). (4.7) 

We conclude that H n is a tight sequence of random variables. The remainder of the proof will 
reveal that in fact H n converges in distribution, either to a constant k*(s) when s S, or to a 
random variable giving positive mass to two values when s £ S. 



4.2 The first-order lower bound 



We start with an investigation of the distribution function of an independent sum of inverse 
powers of exponentials, i.e., 

Z k = E^ s + ... + E~ S . (4.8) 



Lemma 4.1 (Sums of inverse powers of exponentials) Fix s > and put p = 1/s. Then, 
for z — > 0, 

F k {z) ~ a k z-( k -We- kP+tz -\ (4.9) 

where a\ = 1 and (ak)k>2 is defined in (2.3) and where f(z) ~ g(z), z — > 0, means that 
lim^o f(z)/g(z) = 1. 

Proof. The result for fe = 1 is immediate from -Fi(-z) = e~ z P , z > 0. We proceed by induction. 
Suppose that (4.9) holds for some k > 1, then 

= pakZ-^-^z'P I* x-v-^l- x)-( h - 1 W 2 e-*- ph *W dx, 
Jo 
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where hk{x) = x p + k p+1 (l — x) p . The function h k has a saddle point at x = l/(k + 1), since 
h k (l/(k + 1)) = {k + l)P+\ h' k (l/(k + 1)) = 0, and 

h' k \l/{k + l))=p{p + l){k + l) p+2 {l + ^ 
Hence, from a standard saddle point approximation we obtain 

F k+1 (z) ~ pa k z-( k - 1 W 2 z- p [ 1 x- p - 1 {l-x)-( k - 1 W 2 e- s ~ Ph -Wdx 



o 



pa k z^ k ^ p l 2 z~ p (k + If +1 (£;/(fc + i))-(k-i)p/2 e -*-*MV(*+i)), 



2vr 



which yields the result. ■ 
Using the above lemma, we obtain the following first-order lower bound for W k {n): 

Theorem 4.2 (First-order lower bound) Fix s > 0. For each < e < 1, with the function 
g s defined in (2.1), whp and uniformly in k € {2, 3, . . . , n — 1}, 

(\ogn) s W k (n) > (l-e) 9s (k). 

Proof. Fix s > and 2 < k < n, and define, for < e < 1, 

x Kn = x Kn {e) = (l-e)^^- s . (4.10) 

Let N^\x), x > 0, be the number of paths between 1 and n with exactly k edges and weight at 
most x. Then, according to Lemma 4.1, for x ! 0, 

ElN^(x)] = I ~ ("I J)a fe x-^^/ 2 e- fcP+1 - P . (4.11) 

Note that 1 0, and that the term x k ^ ^ p l 2 blows up only polynomially fast, while exp{— k p+1 x k p n \ 

tends to exponentially fast. Using that (TZi) < n k ~ 1 and abbreviating = N k [ n \x k ^ n ), we 
reach to the conclusion that 

E[N£*] < n k ~ l eW {-k p+1 x- p n } = exp | - (^^J " l) (* " 1) log n\ . 

Boole's inequality and the Markov inequality together yield 
n— 1 n— 1 



¥(\J{(logn) s W k (n) < (1 - e)g s (k)}) < £ P((log n)'W fc (n) < (l-e) & (fc)) 

fc=2 fc=2 

n— 1 n— 1 oo ^ 1 

< E p K ?l) ) ^ !) ^ E E K n) ] ^ E ex p -(??— ^ - 1 ) fctog5 

fc=2 fc=2 fc=l ^ ^ ' 



Since the summand on the right-hand side is of order n pek , we may conclude that the probability 
that (log n) s W k {n) < (1 — e)g s (k), for some 2 < k < n — 1, tends to as n — > oo. ■ 
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4.3 Second order asymptotics 

In this section we shall identify the second order asymptotics for the independent minimum of n k ~ l 
random variables, where each of these random variables has distribution function F k (z), z > 0. 
The proof of Theorem 2.1 for < s < 1 follows quite easily from this and the lower bound (4.2). 
The proof of Theorem 2.1 for s > 1 is postponed to the next section. 

We denote by 

^f d) = min Y hJt (4.12) 

where Y^i, . . . ,Y k n k-i are i.i.d. with distribution function Fj.. The following theorem derives the 
asymptotics of W k '■ 

Theorem 4.3 (Minimum for independent r.v.'s) The minimal weight W k ni) defined in (4.12), 
satisfies 

C^^'^-^)^^- {k ' " :) > ! ) - e ~ w ' (4 - 13) 

where (ak)k>2 is defined in (2.3). 

Proof. Denote by Ff. the survival function of the distribution function F^, defined in (4.1), i.e., 
Fk{z) = 1 — Fy.{z). We compute z n = z n (t), such that 

F t~\zn) expl-afce*}. 

This is equivalent to 

n^ 1 log{l - F k (z n )} -»• -o fc e*. 
Since log{l — Fk(z n )} ~ —F k (z n ) for every k fixed and for z n — > 0, we find in turn that 

n^F^Zn) -> a k e\ (4.14) 

or 

(k - 1) log n + log{F k (z n )} -»• t + log a fc . (4.15) 

Put z n = K(logn) _s + C n (i), w here k = g s {k). From Lemma 4.1, we find that (4.15) is equivalent 
to 

(k - l)logn - (k - l)p/21og ( K (logn)~ s + C„(t)) - F +1 (/c(log n)~ s + Cn(t))" P -> (4.16) 
Writing 

K(logn)~ s + Cn(i) = K(logn)~ s (l + C„(t)(log n)7«) , 

yields 

(k - l)logn - (fc - l)p/21og ( K (logn)- s (l + C„(i)(logn)7«)) 

-F +1 K - p logn • (1 + C„.(t)(logn)7K) _p -»• t. 

Using that k p+1 K~ p = k — 1 and ps = 1, we arrive at 

(fc - 1) log n + (fc - l)/2 log(log n) - (k - l)p/2 log (l + Cn(*)(log n)*/«) 
— (fc - 1) logn • (l + ( n {t)(logn) s /K)- p ->t+{k- l)p/21og/e. 
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Now we choose 



Cn(t) = (logn)- s - 1 -(Cloglogn + / l ( i )) or Cn (t) (log n) s = ^^±M1. ( 4 .17) 



Then, 



and 



resulting in 
Hence, 



(k - l)p/21og (1 + CnW(logn)VK) = 0(^^) - 



l)logn- (l + C„(t)(logn)7*s 



logn 



-(fc-l)logn- (l-Kn(t)(logn)7«) 



-{k — 1) logn + 



l)p 



(Clog logn + /i(t)), 



(k-l)ph(t) I,, 
( = -n/2p and 1 — = t + -{k - l)p\ogn. 

K 2 



(4.18) 



z n (t) = g s (k)(log n)~ s + Cn(t) = 5s(A:)(log n)~ s + (log n)" 5 " 1 • (C log log n + h(t)) 



9s(k) g s {k) 



(logn) s (logn) s+1 



log log n t log g s (k) 

2p (k-l)p 2 



(4.19) 



We now turn to the proof of Theorem 2.1 in the case where < s < 1: 

Proof of Theorem 2.1 in case < s < 1. Observe from Theorem 4.2 that, whp and uniform 
in k > 3, 

(logn) s W k (n) > (1 - e) 9s (3) > g s (2), (4.20) 

where the latter inequality follows since for the indicated values of s, the function g s is increasing 
on [2, oo), and where we can take e < mino< s <i [1 — g s (2)/g s (3)] = 1/9. On the complete graph 
with n vertices the paths of length 2 have independent total weight, since they are disjoint. The 
number of paths of length 2 is equal to n — 2 ~ n, so that we can conclude from the previous 
theorem that for, any e > and whp, 

(logn) s W n (2) E (g s (2)-e,g s (2) + e). (4.21) 

From (4.20) and (4.21) it is immediate that W n = W n (2), whp, and hence 

lim H n (s) 2, 

n— >oo 

for < s < 1. Similarly, since W n = W n (2), whp, statement (b) of Theorem 2.1 follows from 
(4.13) for < s < 1 and k = 2. m 



4.4 The case s > 1 



In this section we treat the case s > 1. The number of paths with k edges between the vertices 1 
and n is equal to ClZi) ~ n k ~ l j(k — 1)!. Let S k {n) denote the set of all such paths. As before, we 
let .Ffc denote the distribution function of the sum of k independent random variables each with 
distribution equal to the distribution of E~ s , and by Nj^'(z), z > 0, the number of paths with k 
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edges which have total weight w s {P) = ^2 e£ -pE e s less than z. Recall the definition of z n (t) in 
(4.19) From Theorem 4.3 and its proof (compare (4.14)), we conclude that, as n — > oo, 

Xi n \t) := E[N£\z n (t))} ~ n fc - 1 F fc (z n (t))/(A: - 1)! -> a fc eV(fc - 1)!. (4.22) 

We shall prove the following proposition: 

Proposition 4.4 (Poisson approximation for small weight paths) Fix s > 1 and let Z^' (t) 

be a Poisson random variable with mean X£ (t) . Then, both for k = [s + lj and k = \s + 1] , as 
n — > oo, 

d TV (Nj;\z n (t)),zi:\t))^o, 

where d TV denotes the total variation distance. 



Assuming the proposition let us first show how to complete the proof of Theorem 2.1. 

Proof of Theorem 2.1 in case s > 1 and s £ S. Observe that 

P(W fc (n) > z n (t)) = F(N^(z n (t)) = 0). (4.23) 

Now Proposition 4.4 together with (4.22) implies that, for k = [s + lj and k = [~s + 1] , as n — > oo, 

F(W k (n) > z n (t)) -> exp(-o fc e*/(fc - 1)!). (4.24) 

Note that the weak convergence shows in particular that (log n) s Wk(n) converges in probability 
to g s (k) for the two indicated values of k. This together with the lower bound proven in Theorem 
4.2, and an argument similar to the case < s < 1 then completes the proof of Theorem 2.1, in 
case the integer that minimizes g s (x) is unique, i.e., in case s ^ S. m 

Proof of Proposition 4.4. We apply [5, Theorem l.A] with index set Sk(n), the set of paths 
between 1 and n having precisely k edges. For a € Sk(n), we denote by 

la = Ia(Zn(t)) = l{w s (a) < Z n (t)}, (4.25) 

where as before w s (a) = ^2 e€a E~ s , denotes the weight of the path a, and and where 1{^4} 
denotes the indicator of event A. Furthermore, let p^\t) denote the expectation of I a (z n (t)), 
i.e., 

pW(t) = P(w,{a) < z n (t)) = F k (z n (t)). (4.26) 

Let I* (a) C Sk(n) denote the set of paths (not including a) which have at least one edge in 
common to a (i.e., X*{a) is the set of paths f3 for which In is 'strongly' dependent on I a ), and 
let S* (a) C Sk(n) denote the set of paths that do not overlap on any edge with a. Note that the 
random variable w s (ot) is independent of {w s (/3): (3 £ S*(a)}. Finally, let 

Z a = Y, lR(/3) < 2n(*)}. 

pel* (a) 

By the independence w s (a) and {w s (/3): (3 S <S*(a)}, [5, Theorem l.A] implies that 



d TV (N^(z n (t)),Z^(t)) < 



P[:\t)+nz a } + ^^, (4.27) 
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where the last equality follows since \ k n \t) = |<Sfc(n)|pL n) (i) and since E[/ a Z a ] is independent of 
a. As before, by the choice of z n (t), 

„*-ipW(t)->a fc eV(fc-l)! 

Thus, in particular, p k (t) — > 0, as n — > oo. Further, there exists a constant C k such that, as 

n — > oo, 

E[Z(a)] = |Z>)K ra) (t) < C fc n fe "X n) (t) -> 0. 

Thus, the first two terms in (??) are negligible as n — > oo. The last term requires some more 
analysis. 

We note that 

k-2 

E[I a Z a ] = Y,\l*kA a Mj( t )- ( 4 - 28 ) 

Here X| .(a) C S k (n) consists of the set of paths of length k which overlap with a in exactly j 
edges, while 

pjg(i) = P(X M < z n (t),X kJ < z n (t)), 

where A% fc = £jj =1 while = £ J r=1 £~ s + Er= J+ i K s , 1 < j < fc - 2, and (^)Li and 
{Er)r=i are t wo independent vectors of i.i.d. Exp(l) random variables. We bound the probability 
Pkj(t) in the same way as before, using the saddle point approximation: 

Lemma 4.5 (Correlated sums of inverse powers of exponentials) Fix k > 3 and 1 < j < 

k — 2. Then, for z \. 0, 

F(X ktk < z, X kd < z) ~ C fc|i ^ (fc _ ._^ + . p/2 exp (-*"*[(*: - j> + j7 +1 ) . 
ffere v = 2 1 l^ +1 ^ and C kJ > is a constant. 

Proof. The proof is given by straightforward computation using the saddle point approximation: 
P(X kjk < z, X k>i <Z )=¥(J2 E~ s <z,Y^ E; s + ( E r)~ S < z )= F Li(z ~ V) dFi{y) 

r=l r=l r=i+l 

(a^z - y )-(^-- 1 W 2 e-( fe - i ) P+1 ^-^" I, ) 2 da iZ -( i - 1 >/ 2 e- iP+1 ^ J ' 
Jo 

= ttitt^ f y-(^lW2-le- P+1 «- p (pi^-p - (f - l)p/ 2 ) (z - yy^-Dp^k^Y+^z-y)^ dy 

Jo 

x (1 - x)-^-*- 1 )* exp{-z- p /i fcii (x)} dx, 
where we abbreviate 

^fc,i(ac) = + 2(ife - - x)~ p . 

i 

Put = 2P+ 1 . Then, the saddle point arises in the point x kt i, satisfying h' ki (x k ^) = 0, which 
yields 

i 

x k,i = Ti ^ I 
[k — %)v + i 
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Furthermore, hk,i(x kt i)) = {(k — %)v + i) P+1 , while 



Applying the saddle point approximation then yields: 

P(X fc , fc < z,X kii < z) ~ a i 4_^- (fep - (i+3)p/2) (^ifc ) i)~ (i " 1)p/2 " 1 (^ P+1 (^it ) i)~ P " (» - 1)p/2) 



x(l - Xfc,,)"^- 1 ^ exp{-^/ iM (x fe)i )} 1 



2vr 



(4.29) 



Recall (4.27). The first two terms on the right-hand side are negligible, hence it suffices to show 
that 

E[I a Z a ] 







Using that = 0(n"( fe_1 )) and that ^-(a)] ~ it follows from (4.28) that we now 

need to show for 1 < j < k — 2, 



rP-'-V&t) -> 0, 



as n — >■ oo. Now the polynomial terms (z fc type terms) in the saddle point approximation of 
p k "j(t) should not play a role. Thus, using the fact that up to the first order 

9s(k)\-P (fc-l)logn 



(*»(*))" 



log 5 n 



(4.30) 



we need to show that for 1 < j < k — 2 and with v = 2 1 ^ p+1 \ 



1 3\ 



p+i 



fc-i 



> 0. 



(4.31) 



The above inequality is not true for s close to and larger values of k. However, it is true for 
s > 1 and k € { [s + lj , [~s + 1] } as we will now show. Indeed, define, for x G [0, 1], 



u k (x) = \{l - x)2 s /( s+1 ) + 



(i+V«) 



A- 



Jfe — 1 



and note that Uk(j/k) is equal to the left side of (4.31). Hence, if we show that for both k = [s+l\ 
and k = \s + 1], the function tifc(x) > for all x G (0,1), then we are done. Differentiating 
x h-» Ufc(x) with respect to x yields 



u' k {x) 



'l-x)2 1 ' a + x 



- 1 &/*-!)+ k 



k-l 



where a = (s + l)/s > 1. The function v! is increasing as can easily be seen from the second 
derivative 



a-2 



u'l(x) = a(a - 1)(1 - 2 1 ' a ) 2 {l-x)2 l ' a + x > 0. 
Hence, we have to show that u' k (0) > 0, for the two indicated values of k. 
Claim: Fix s > 1, then the condition u' k (0) > is true for both k = [s + lj and k = \s + 1] . 
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Proof of claim. Since a = (s + l)/s, the condition u' k (0) > is equivalent to 

_i_ > £±i (2 _ 2 ./(. + i) ) . 

Now substitute either k = [s + lj or k = \s + 1] and simplifying the claim u'(0) > shows that 
it is equivalent to 

sk(2 s /( s+ ^ - 1) > (jfe - (s + 1))(2 - 2 S /( S+1 )). 

This inequality is trivially true for k = [s + lj , since then the right side is negative, whereas the 
left side is positive. For k = \s + 1] , the left side is at least 3(2 S// ( S+1 ) — 1), whereas the right side 
is at most 2 — 2 S ^ S+1 \ while s > 1. Furthermore, again using that s > 1, 

3(2«/(»+i) _ !) > 3(^/2 _i) >2 -^/2>2 - 2 S ^ S+1 \ 

This shows that the above claim holds and hence that the Poisson approximation holds both for 
k = [s + lj and k = \s + 1] . ■ 



4.5 The case s G S, the special set 

In this section, we will prove Theorem 2.2. To this end, we fix sj S S and denote by k = [sj + lj , 
so that k + 1 = \sj + 1], Let = (z n (x)) denote the number of paths from 1 to n of 
length k and with weight at most z n (x), with z n (x) given by (4.19), and similarly we denote by 
M^ n) = Mu (z n (y)) the number of paths from 1 to n of length k + 1 and with weight at most 
z n (y), where z n (y) is given by the right-hand side of (4.19), with t replaced by y and k by k + 1. 
Note that the change from k to k + 1 is for many aspects irrelevant, because for s = Sj, we have 
g s (k) = g s (k + 1). We are therefore in particular allowed to use the same quantity z n (y) in the 
definition of . We will show below that the total variation distance between iV^ n) + M^ l) and 
a Poisson variable with mean /j,^\x,y) = K[N^ + Ml ] converges to as n — > oo, i.e., we will 
show that 

d TV (Nl n) + M£\Voi(^ ] {x,y))) -> 0. (4.32) 
Let us first prove that (4.32) implies Theorem 2.2. Indeed, (4.32) implies that 

F(W k (n) > z n (x), W k+1 (n) > z n {y)) = ¥{N { k n) = 0, M< n) = 0) = P(A^ + M ( k n) = 0) 

-> P(Poi(/i fe (rE,y)) =0), 

where Hk{x,y) = lim^oo/x^fa;,^) = A fc (x) + A fc+ i(y), by (4.22), and where we define 

Ai(z) = a/e 2 /(/ - 1)!, I > 1, z > 0. (4.33) 

Thus, 

lim P(W fc (n) > «„(x), W fc +i(n) > z n (y)) = lim P(W fc (n) > z„(x)) lim P(Wfc+i(n) > ^(y)). 

n— Yoo n— >oo n— >oo 

(4.34) 

Consequently, comparing with (4.23), we see that the events {Wk{n) > z n (x)} and {Wk+i{n) > 
z n (y)} are asymptotically independent. It is then straightforward to conclude that the normalized 
optimal weight W n in Theorem 2.2 converges in distribution to the minimum of the independent 
pair (Sfc/(A; — 1), S^+i/k). The lower bound for (logn) s Wk(n) of Theorem 4.2 again completes 
the proof of part (a), and subsequently also part (b), of Theorem 2.2. 

In order to prove (4.32), we rely again on the Poisson approximation in [5], Set Tk(n) = Sk(n) U 
<Sfc+i(n) the index set of all paths from 1 to n having either k or k + 1 edges, where as before 



15 



k = \_Sj + lj . To denote that the length of a path is equal to A;, we give it a subscript k and write 
a k for an element of S k (n). For a k G S k (n), we denote by 

Ia k = I ak (z n (x)) = l{w s (a k ) < Z n (x)}, 

whereas for a path a k+ \ G S k+ i(n), we define 

Ia k+1 = Ia k+1 {Zn{y)) = l{w s {a k+l ) < Z n (y)} , 

so that 

pf(x) = PK(a fc ) < z n {x)) = F k {z n (x)), vtlM = ^KK+i) < *»60) = F fc+1 (z n (y)). 

Writing a for ajt or a k +i, we denote by 2/* (a) C Tk(n) the set of paths (not including a) which 
have at least one edge in common to a , and by S* (a) C T k (n) the set of paths that do not 
overlap on any edge with a. Finally, let 

z a = Yl HMPk) < z n (x)}+ Yl HMPk+i) < z n {y)}- 

/3 k eZ*{a) p k+1 ez*{a) 
The total variation distance in (4.32) is bounded by 

^aes k (n) + p£°(*)E[Z ] +E[I a Z a ]] E a es k+1 (n) MlM? + P ( k n li(y)nZ a ] +E[I a Z 



+ 



Since 

tf\x,y)=p£\x)\S k (n)\ + > m^^x^S^lp^y)^^}, 

we conclude from the proof of Proposition 4.4 that 

H ( k n) (x,y) ^ k \x,y) 
Hence, it remains to prove that 

We next decompose E[7 Q ,Z Q ,] into the part where j3 has k or k + 1 edges, i.e., 

E[/ a Z a ]= J] P(/ a = l,/ & = l)+ ^ P(/ Q = 1,I &+1 = 1). 

I3 k £l*(a) k+1 £Z*(a) 

By making this decomposition, as well as differentiating between the number of edges of a, the 
numerator in (4.35) splits into 4 different double sums. The two double sums running over the 
index sets a G S k (n),[3 k G and a G <Sfc+i(n), G 2/*(a) are treated in the proof of 

Proposition 4.4, apart from the small change that z n (x) and z n {y) are now possibly different. 
Since z n (x) / z n (y) — > 1, it is straightforward to adapt the argument. Below, we will show that 

Eqg«S fc (n) E/3 fc+1 eZ*(a) ^(^ = = l ) , , 

f*k ix,y) 

The term with a G <Sfc+i(n) and (3 k G X* (a) is identical, apart from the fact that x and y are 
interchanged. Thus, (4.36) completes the proof of (4.32). 
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To prove (4.36), we write, as in (4.28), 

P(ia = = 1) = E K+ljNbKlJ^^)' 

/3 k+1 el*(a) j=l 

where 2^ +1 .-(a) C T k (n) consists of the set of paths of length k + 1 which overlap with a, which 
has length k, in exactly j edges, while 

Pkl ld {x,y) = F(X k:k < z n (x),X k+ld < z n (y)), 

where, similarly as in the proof of Proposition 4.4, we now write X kyk = Ylr=i ^r S ^ while -Xfc+ij = 
£U E~ s + E~ s , l<j<k-l. 

By adapting the saddle point approximation used in the proof of Lemma 4.5, it is readily verified 
that for z±, z-i i- such that lim^^o z%jzi = 1, and k > 3 and 1 < j < k — 1, we have 

F(X kik < zi,X k+1>j < z 2 ) = exp {-z^ p [(k - j)u + j + l] p+1 (l + o(l))) . (4.37) 

where, as before, v = 2 1 ^ P+1 K By (4.30) (z n (x))~ p ~ ^"^i+° g "' - Further, since s E 5, we have 
that (z n (y))" p ~ (fc +f)i+ P = (fc ~ fc i+ 1 ° g "' - Thus, lim n ._ > . 00 z n (y)/z n (x) = 1, as required. 

We conclude that in order that (4.36) holds we need to show that n _J_1 f4+i At) — > 0, or 
equivalently that 



exp — log n 



{1 -k )u+i -ir 



p+i 



(2k -3-1) 



-> 0, 1 < j < k - 1. 



This follows from the facts that v = 2 1// ( p+1 ) < 2 and p > 0, since, for 1 < j < k — 1, 



(2fc - J - 1) 



7-1, 7-l\P+l 
< ((1- J —— )v + J — 



i-i 
it- l 

j. ,j + l , z/-2\p+ 



+ 



fc 



This proves (4.36), and, thus, completes the proof of (4.32). 



4.6 Multipoint distance limits 

In this section, we discuss how to prove Corollary 2.3. We shall indicate how to prove this 
corollary for 2 multipoint distances. The case for general m follows similarly. 

More precisely, let W% , W% denote the recentered and rescaled optimal weights between 1 and 2 
and 1 and 3. Recall, for any fixed t 6 R, the function z n (t) from (4.19), where we take k = k*(s). 
For j = 2,3 and any t 6 R, let N 3 k ^ n \z n (t)) denote the number of paths between 1 and j having 
k edges whose weight is less than z n (t). 

The proof of Corollary 2.3 will be an adaptation of the proof of Theorem 2.2 in Section 4.5, and 
we start by recalling some results we have proved and shall rely on. Recall that we have already 
proved that, as n — > oo, 

\[?At)=E(Nt n) (zn(t)))~\k(t), 
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where X k (x) = a k e x /(k — 1)! is defined in (4.33) and 

d TV (Nl' { "\z n (t)),Poi(X k (t)))^0. 

For any fixed x,y 6 R, define 

iV* = N l k ^\z n {x)) + Nl^\z n {y)). (4.38) 

Below, we shall show that 

N*^Poi(X k (x) + X k (y)). (4.39) 

Then, the argument leading to (4.34) implies that and W% are asymptotically independent, 
so that 

lim F(W? > z,W? >y)^ exp(-X k (x) - X k (y)), 

n— >oo 

establishing the result we want. We next sketch how one can prove (4.39). 

Idea of the proof of (4.39). Fix any path a with k edges between 1 and 2 and path ft between 1 
and 3. Since the argument is quite close to the proof of Theorem 2.2, we shall keep the discussion 
brief and focus on the differences. We again rely on the total variation bound [5, Theorem l.A] 
that implies 

d TV (K,Poi(X k "\x) + X£\y))) < (/) + (//) + (///) 



Here, 



and, as in (4.25) 



X ( k n) (x)+X^(y) ■ 

(I) = pP(x)X k n \x)+p k n \y)X^(y), 
(II) = (E[Z 1 a ] + E[Zl})Xi n \x) + (E[Z}} + E[Zj])X[:\y), 
(III) = E[I a Zl + I a Zl + I p Zl + I p Z% 

I a = l{K(a)| < z n (x)}, I p = l{\w 8 (ft)\ < z n (y)}, 



while, writing I* (a) for the set of paths from 1 to 2 which overlap with a and X^(a) for the set 
of paths from 1 to 3 which overlap with a (and similarly for ft), 

Zl= H\ws(l)\ < z n (x)}, and Z 2 a = £ 1{K( 7 )| < z n (y)}, 

7eX*(o) 7611(a) 

and similarly for Z^ and Zj. Now, we have already shown that the terms (I) and (II) rescaled 
by the means are negligible as n — > oo. Thus, to complete the proof we just need to show that, 
as n — > oo, 

X^(x) + X^(y) 

(and the corresponding term for the path ft), namely the contribution of terms consisting of the 
overlap of the path a (which goes from 1 to 2) with paths which go from 1 to 3. This is a minor 
adaptation of the proof of (4.36), and we omit the details. ■ 
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